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Abstract — In this work, we study the problem of power 
allocation and adaptive modulation in teams of decision makers. 
We consider the special case of two teams with each team 
consisting of two mobile agents. Agents belonging to the same 
team communicate over wireless ad hoc networks, and they try 
to split their available power between the tasks of communication 
and jamming the nodes of the other team. The agents have 
constraints on their total energy and instantaneous power usage. 
The cost function adopted is the difference between the rates 
of erroneously transmitted bits of each team. We model the 
adaptive modulation problem as a zero-sum matrix game which 
in turn gives rise to a a continuous kernel game to handle 
power control. Based on the communications model, we present 
sufficient conditions on the physical parameters of the agents 
for the existence of a pure strategy saddle-point equilibrium 
(PSSPE). 

I. Introduction 

The decentralized nature of wireless ad hoc networks makes 
them vulnerable to security threats. A prominent example of 
such threats is jamming: a malicious attack whose objective 
is to disrupt the communication of the victim network inten- 
tionally, causing interference or collision at the receiver side. 
Jamming attack is a well-studied and active area of research 
in wireless networks. Unauthorized intrusion of such kind 
has initiated a race between the engineers and the hackers; 
therefore, we have been witnessing a surge of new smart 
systems aiming to secure modem instrumentation and software 
from unwanted exogenous attacks. 

The problem under consideration in this paper is inspired 
by recent discoveries of jamming instances in biological 
species. In a series of playback experiments, researchers have 
found that resident pairs of Peruvian warbling antbirds sing 
coordinated duets when responding to rival pairs. But under 
other circumstances, cooperation breaks down, leading to more 
complex songs. Specifically, it has been reported that females 
respond to unpaired sexual rivals by jamming the signals of 
their own mates, who in turn adjust their signals to avoid the 
interference [14|. 

Ad hoc networks consist of mobile energy-constrained 
nodes. Mobility affects all layers in a network protocol stack 
including the physical layer as channels become time-varying 
[[7j. Moreover, nodes such as sensors deployed in a field or 
military vehicles patrolling in remote sites are often equipped 
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with non-rechargeable batteries. Power control and adaptive 
resource allocation (RA) play, therefore, a crucial role in 
designing robust communications systems. At the physical 
layer, power control can be used to maximize rate or mini- 
mize the transmission error probability, see [2|, [8 1 and the 
references therein. In addition, in multi-user networks, power 
control can be used to regulate the interference level at the 
terminals of other users [4], [12] , [15,1 . Due to the lack of 
a centralized infrastructure in ad hoc networks, distributed 
solutions are essential. In this work, similar to [^, (12], [TSl, 
we model the power allocation problem as a noncooperative 
game, which allows us to devise a non-centralized solution. As 
a departure from previous research, however, the power control 
mechanism we propose splits the power budget of each player 
into two portions: a portion used to communicate with team- 
mates and a portion used to jam the players of the other team. 
More importantly, the objective function is chosen to be the 
difference between the cumulative bit error rate (BER) of each 
team; this allows for increased freedom in choosing physical 
design parameters, besides the power level, such as the size 
of modulation schemes. 

Adaptive RA mechanisms involve varying physical layer 
parameters according to channel, interference, and noise con- 
ditions in order to optimize a specific metric, such as spec- 
tral efficiency. Adapting the modulation scheme, choosing 
coding schemes, and controlling the transmitter power level 
are examples of adaptive RA schemes. Goldsmith and Chua 
demonstrated in [6[ that adaptive RA provides five times more 
the spectral efficiency of nonadaptive schemes. In this work, 
we propose an adaptive modulation scheme based on a zero- 
sum matrix game played by both teams. 

The conflicting objectives of the two teams entails the use of 
game-theoretic framework to study this problem. We identify 
three main tasks that each team needs to perform: 

1) Optimal trajectory: computing the optimal motion path 
for the agents 

2) Power Allocation: dividing power between internal com- 
munications and jamming 

3) Adaptive Modulation: choosing an appropriate modula- 
tions scheme 

We addressed Task 1 in [|3] by posing the problem as a 
pursuit-evasion game. The optimal strategies of the players are 



obtained by using techniques from differential game theory. 
We also addressed Task 2 in |31 using continuous kernel static 
games; we will generalize the formulation in this work to 
include a minimum rate constraint. This will lead to a more 
practical scheme as it guarantees a non-zero communications 
rate. Task 3 will be addressed in this work using static matrix 
games. The saddle-point equilibrium of the power allocation 
problem is parametrized by the modulation schemes of the two 
teams. We therefore introduce a third game in order to arrive 
at the equilibrium modulation schemes. In fact, this gives rise 
to a games-within-games structure: the optimal trajectory is 
first found, power allocation is then performed, and finally the 
optimal modulation is computed at each time instant. 

The main contributions of this paper are as follows. We 
introduce a third layer of games to our recent work [l3l in 
order to perform adaptive modulation. We also generalize 
the power allocation problem introduced in [3 | to ensure 
non-zero communications rate. We introduce an optimization 
framework taking into consideration constraints in energy and 
power among the agents. Moreover, we relate the problem 
of optimal power allocation for communication and jamming 
to the communication model between the agents. Finally, we 
provide a sufficient condition for existence of an optimal 
decision strategy among the agents based on the physical 
parameters of the problems. 

The rest of the paper is organized as follows. We formu- 
late the problem in Section and explain the underlying 
notation. The saddle-point equilibrium properties of the team 
power control problem are studied in Section |III] with the 
specific example of systems employing uncoded M-quadrature 
amplitude modulations (QAM). We introduce our adaptive 
modulation scheme in Section |lVl Simulation results are 
presented in Section |V] We conclude the paper and provide 
future directions in Section |VI] 

II. Problem Formulation 

Consider two teams of mobile agents. Each agent is com- 
municating with members of the team it belongs to, and, at the 
same time, jamming the communication between members of 
the other team. In particular, each team attempts to minimize 
its own BER while maximizing the BER of the other team. 
We consider a scenario where each team has two members, 
though at a conceptual level our development applies to higher 
number of team members as well. Team A is comprised of 
the two players {1°, 2"} and Team B is comprised of the two 
players {1^2''}. We assume that fa and ft are the frequencies 
at which Team A and Team B communicate, respectively, and 

fa fb- 

Naturally, for an initial position Xq G X, the outcome of the 
game tt, is given by the difference in the BERs of both teams 
during the entire course of the game. Formally: 

^(xo,<,u^^) ^N- r [pl{t) +Pm-P\{t)-P2{t)] dt, 
Jo , ' 
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where p°;{t) and p^{t) are the BERs of agent i in Team A 
and and agent j in Team B, respectively; uf and are 
likewise the control inputs of agents i and j in teams A and B, 
respectively; N is the total number of transmitted bits which 
remains constant throughout the game; and T is the time of 
termination of the game. We conclude that the objective of 
Team A is to minimize tt, whereas that of Team B is to 
maximize it. 

Since the agents are mobile, there are limitations on the 
amount of energy available to each agent that is dictated 
by the capacity of the power source carried by each agent. 
The game is said to terminate when any agent runs out of 
power Let P°^{t) and Pj{t) denote the instantaneous power 
for communication used by player i in Team A and player 
j in Team B, respectively. We model this restriction as the 
following integral constraint for each agent: 

/ P,{t)dt<E (1) 

For each transmitter and receiver pair, we assume the 
following communications model in the presence of a jammer 
which is motivated by ifTSl . Given that the transmitter and 
the receiver are separated by a distance d, and the transmitter 
transmits with constant power Pt, the received signal power 
Pa is given by 

Pb = pPrd-", (2) 

where a is the path-loss exponent and p depends on the 
antennas' gains. Typical values of a are in the range of 2 
to 4. According to the free space path loss model, p is given 
by: 

GtGb,X 

(4^)2 ' 

where A is the signal's wavelength and Gt, Gr are the 
transmit and receive antennas' gains, respectively, in the line of 
sight direction. In real scenarios, p is very small in magnitude. 
For example, using nondirectional antennas and transmitting 
at 900 MHz, we have p = = 6.896 x lO"-*. 

The received signal-to-interference ratio (SINR) s is given 

by 



where cr^ is the power of the noise added at the receiver, and 
/ is the total received interference power due to jamming and 
is defined as in (|2]i. 

Let Pi{t) and P^{t) denote the instantaneous power levels 
for communication used by player i in Team A and player j 
in Team B, respectively. Since the agents are mobile, there are 
limitations on the amount of energy available to each agent 
that is dictated by the capacity of the power source carried by 
each agent. We model this restriction as the following integral 
constraint for each agent 

/ P,{t)dt<E. (4) 
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In addition to the energy constraints, there are limitations on 
the maximum power level of the devices that are used onboard 
each agent for the purpose of communication. For each player, 
this constraint is modelled by < P°'{t),P^{t) < Pmax- 

We also assume that players of each team have access to 
different M-QAM modulation schemes. We denote the set of 
available modulation sizes to the players in Team A by 
and that available to players of Team B by Ai^. The sizes of 
the employed QAM modulation by the teams are M° e 
and £ Ai^. We assume that Team A can choose among 
n different modulation schemes, and Team B chooses from a 
set of m different schemes, i.e., \A4°-\ = n, \A4^\ = rn. 

The instantaneous BER depends on the SINR, the modu- 
lation scheme, and the error control coding scheme utilized. 
Communications literature contains closed-form expressions 
and tight bounds that can be used to calculate the BER when 
the noise and interference are assumed to be Gaussian (5]. 
For uncoded M-QAM, where Gray encoding is used to map 
the bits into the symbols of the constellation, the BER can be 
approximated by lilll 



TABLE I 

Decision variables and distances among agents. 



(5) 



where r = log(M), ( 4(1 - l/VM), P = 3/(M - 1), 
and is the tail probability of the standard Gaussian 

distribution. 

To ensure a non-zero communication rate between the 
agents of each team, we impose a minimum rate constraint 
for each agent: R°;{t), R\{t) > R, where ^ > is a threshold 
design rate, which we assume is the same for all agents. The 
results can be readily extended to networks of players having 
a different value of the minimum design rate. 

At every instant, each agent has to decide on the fraction 
of the power that needs to be allocated for communication 
and jamming. Table |l] provides a list of decision variables for 
the players, which models the power allocation. Each decision 
variable is a non-negative real number and lies in the interval 
[0, 1]. The decision variables belonging to each row add up to 
one. The fraction of the total power allocated by the player 
in row i to the player in column j is given by the first entry 
in the cell This allocated power is used for jamming if 

the player in column j belongs to the other team; otherwise, 
it is used to communicate with the agent in the same team. 
Similarly, the distance between the agent in row i and the agent 
in column j is given by the second entry in cell (i, j). Since 
distance is a symmetric quantity, — dP'' and d^^ ~ d^^. 
Fig. [1] depicts the power allocation between the members of 
the same team as well as between members of different teams. 
In addition to power allocation, each team has to decide on 
the size of the QAM modulation to be used. In summary, 
each agent has to compute the following decision variables 
at each instance in accordance with the above tasks: (i) the 
instantaneous control (Task 1); (ii) the instantaneous power 
level, Pi{t) (Task 2); (iii) all the decision variables present in 
the row corresponding to the agent in Table U (Task 2); and 
(iv) the size of the QAM schemes, M° or M** (Task 3). 
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Fig. 1. Power allocation among the agents for communication as well as 
jamming. 



III. Power Allocation 

From (|3]l, the received SINR and the rate achieved by each 
agent are given by the following expressions 



PaPnt)j^Hd'')-" 



p,P^{t)S{{d\)-'- + pbP^{t)S^,idl)- 



p,pHt)s,Mj) 



log(l + s,°), R\^\og{l + s)) 



(6) 



Agents of the same team embark in a team problem which 
eliminates their need to exchange information about their 
decision variables. Further, since agents in different teams 
do not communicate, they possess information only about 
their own decision variables. This makes the power allocation 
problem a continuous kernel zero-sum game between the 
teams: 

Team A: The objective of each agent is to minimize L. 



min LiM", M^) ^ min 



y,~p\-pl) (7) 



subject to: < < P„ax, > R 

71+72+7''^' = 1, 71,72,7'^' >0 
Team B: The objective of each agent is to maximize L. 



max UW^M") 



subject to: < P^{t) < Pn 
5} + 5j + S,,, = 



^^max^ (8) 



ax, Ri > R 

1, 6lSl6,,>0 



Note that the power allocation vector for 1" denoted by 
7 — (7^^,71,72) belongs to the intersection between the 
three-dimensional simplex A'^ and the plane ro, where ro = 
{7|7^^ = ^(2^ ~ !)}■ The power allocation vectors of other 
players belong to similar sets. 

In [3 1, we showed that the optimal value of the power 
consumption for each player is Pmax- We also showed that the 
entire game terminates in a fixed time T — irrespective 
of the initial position of the agents. Moreover we provided 
a sufficient condition for the existence of a pure-strategy 
saddle-point equilibrium (PSSPE) for the power allocation 
game when uncoded M-QAM schemes are used by all agents. 
Here, we modify the condition to allow teams to use different 
modulation schemes as made formal by the next theorem. 

Theorem 1: When all players employ uncoded 7\f-QAM 
modulation schemes, the power allocation team game formu- 
lated above has a unique PSSPE solution if the following 
condition is satisfied: 



p 



M<^ - 1 ' M^-l 



(9) 



For the special case of Af " = M'' = M and pb 
condition becomes 



Pa = p, the 



l3pP„ 



iK',di2})' 



(10) 



The proof is similar to that presented in [T| and is omitted 
here. Note that the left hand side of inequality ( fTOl i depends 
entirely on physical design parameters; this is of particular 
importance for design purposes. Moreover, we showed in |]3] 
that this condition can be expressed in terms of the received 
signal-to-noise-ratios (SNRs) for all players, which could be 
more insightful from a communication systems perspective. 
Consider, for example, 1", and let SNR^ = Prr^a^i^pidD 



and SNR. 
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-. We then have: 

u ~ 

|(SNR| + 1) z,j e{l,2};t^j 

P 



Yet another useful way to interpret condition ( fTOt is regarding 
it as a minimum rate condition: 



r > loa 



^ ^ pPrnax (min{d^^ , di2 }) 



Assuming (fTOl i holds, the objective function is strictly 
convex in the decision variables of 1°, 2'^ and strictly concave 



in the decision variables of l**, 2^. A unique globally optimal 
solution (7) therefore exists, which we characterize using the 
KKT conditions 19]. Consider, for example, the case of 1". 
The expressions for SINR provided in (|6]l relevant to the 
optimization problem being solved by 1° can be written in 
a concise form as shown below: 



S2 = 
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Pn^ 
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The KKT conditions can then be written as: 

4 



VL?(7) + ^»V;i,(7) + ryV/i(7) = 0, 

1=1 

A,/i,(7) = 0, A„7?>0, {1,2,3} 



(11) 



where 



/ii(7) 

m 



-7^^ + min • 



-,1 ? < 



-7i < 0, /i3(7) = -72 < 
7''+7i+72-l = 



Now, we present the necessary and sufficient conditions for 
the solution to the optimization problem for the agents. Let 
us consider the case of 1°. The assumptions in Theorem 4 
regarding strict convexity of L\ render the KKT conditions 
to be necessary as well as sufficient conditions for the unique 
global minimum. 

To this end, we obtain: 



V/ii(7) 



ai5'(si) 



,V/i2(7) 
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,V/i3(7) 



Since 7 e A'^flro, at most three of the constraints can be active 
at any given point. Hence, the gradient of the constraints at 
any feasible point are always linearly independent. 

If two of the three constraints among {/ii, /12, ^3} are active, 
then 7 has a unique solution that is given by the vertex of the 
simplex that satisfies the two constraints. If only one of the 
constraints among {/ii,/i2,^3} is active, then we have the 
following cases depending on the active constraint 





M^{1) 




M''{m-1) 




M''{m) 








L{M''il),M''{2)) 


L{M''{l),M''{m~ 


1)) 


L{M''{l),M''{m)) 




M''(2) 


L(M°'(2\M''(1)) 


L(7W 2), A^" 2)) 


L(A^"(2),7W(m- 


1)) 


L(M°-(2),M''(m)) 


Team A 




L{M''{n),M\l)) 


L{M''{n),M\2)) 


i(X''(n),X^(m- 


1)) 


L(X"(n),7W''(m)) 










Team B 






(12) 



1) /ii(7i) = 0: 7' = (7'^7i'*,l 
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in I ^ ^ , l|, satisfies the equation 



7-*-^), where 



[ei + (1 - 7!* - 7I2)] 



ci + 7( 



1*12 



2) /i2(7') = 0: 72 = 
following equation 

3) /i3(7') = 0: 73 = 
following equation 



(13) 



(1 - 72*, 0,72*) satisfies the 

(ei +72 ) 
(1 - 7|*,7|*,0) satisfies the 



(15) 



If none of the inequality constraints are active, then 7** 

(i-7i 



1* 



72 



1* 



7^^,7i*,72*) is the solution to: 



aig'{s2) 
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M23'(^?) = 

g'{4) = o 



(16) 



Here, 7 lies in the set 

{(1,0,0), (0,1,0), (0,0,1), 71,72, 73, 7-*}. An important 
point to note is that ai,bi,ci,di and ei depend on the 
decisions of the other players. Therefore, the computation of 
the decision variables depend on the value of the decision 
variables of the rest of the players. A possible way to deal 
with this problem is to use iterative schemes for computation 
of strategies. [1| provides some insights into the efficacy of 
such schemes from the point of view of convergence and 
stability. In this work, we assume that each agent has enough 
computational power so as to complete these iterations in a 
negligible amount of time compared to the total horizon of 
the game. The specific conditions for 1° corresponding to 
([TsT l-dTSll when M-QAM modulations are utilized are: 
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Also, ( fT6] l in this case corresponds to solving ( fTSl l and (fT9] l 
jointly. 

IV. Adaptive Modulation 

The time-varying nature of the channels due to mobility 
emphasizes the need for robust communications. Adaptive 
modulation is a widely used technique as it allows for choosing 
the design parameters of a communications system to better 
match the physical characteristics of the channels in order 
to optimize a given metric such as: minimizing BER or 
maximizing spectral efficiency. In this work, we model the 
adaptive modulation as a matrix zero-sum game between the 
two teams. We therefore look for an equilibrium solution 
which would dictate what modulations should be adopted 
by the teams at each time instant. The competitive nature 
of the jamming teams makes our approach to the problem 
most practical as any other non-equilibrium solution cannot 
produce an improved outcome, relative to that yielded by the 
equilibrium, for any of the teams. 

The matrix game is given in (fT2] i. The rows are all the 
possible actions for players of Team A, and the columns 
are the different options available for Team B. The {i,j)-th 
element of the matrix is the value of the objective function L 
when Team A employs A/" = A4"'{i), and Team B employs 
Aff- = M''{j). 

A PSSPE does not always exist for the power alloca- 
tion game. The condition for the existence of a PSSPE is 
mini maxj Aij ~ maxj min^ A^^ |[T|. In case a PSSPE does 
not exist, we need to look for a solution in the larger class of 
mixed-strategies. A pair of strategies {A/°*,Af''*} is said to 
be a a mixed- strategy saddle point equilibrium (MSSPE) for 
the matrix game if |[T] 

(Af"*)^AAf'' < {M''*fAM''* < {M^fAM^* 

For an n X TO matrix game, the following theorem from |T1, 
which we state without proof, establishes the existence of an 
MSSPE for the adaptive modulation game. 

Theorem 2: The adaptive modulation game admits an 
MSSPE. 

In case multiple MSSPEs exist, the following corollary 
becomes essential IT]. 

Corollary 1: If {M''{ii),M^{ji)} and 

{M''{i2),M^{j2)} are two MSSPEs of the adaptive 
modulation game, then {A^'^(ii), A^''(j2)} and 
{M''[i2)M^{3i)} are also MSSPEs. 



This is termed the ordered interchangeability property and its 
importance Hes in that it removes any ambiguity associated 
with the existence of multiple equilibrium solutions as the 
teams do not need to communicate to each other which 
equilibrium solution they will be adopting. Literature contains 
different efficient low-complexity algorithms that computes 
MSSPEs for matrix games, such as Gambit ifTOl . We refer 
the interested reader to [Tl for a discussion of some of 
these approaches. Section [V] illustrates these concepts and 
shows how the choice of modulation size changes with the 
characteristics of the environment. Finally, it is assumed that 
players of each team communicate their modulation choices 
among themselves through a reliable side channel. 

V. Simulations Results 

To better understand the adaptive modulation scheme, 
we present the following example. Let ~ — 

{16,64,265}, dl = 17.7864, d] = 15.3376, dj = 19.8951, 
dl = 14.1128, ^ 20.6309, di2 = 26.3224, pa = 0.0570, 
Pb = 0.0517, P,nax = 100, cr2 = 10"^ ^ = 1, and a = 2. 
The matrix A corresponding to these values was found to be: 
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0.0533 


0.1229 


A = 64 


-0.0356 


0.0091 


0.0728 


256 


-0.1155 


-0.0677 


0.0040 



Team A 



Team B 



Note that the third row dominates the other rows strictly, and 
there is a unique PSSPE given by {256, 256} in this case. 

Fig. [2] depicts how the teams adapt their modulation scheme 
relative to the SNR, which we define as Pmax/cr^ ■ The set of 
modulations available to each team is Al = {16,20,24,28}. 
Players 1", 2", and l** were placed close to each other, while 
2^ is far from all of them; in particular: d\ ~ 2.2036, d\ — 
33.6830, dl = 2.4211, dl = 33.6393, d^^ = 4.5607, dil = 
33.2022. We also let pa = 0.0570, pb = 0.0517, P,nax = 1, 
R = 1, a = 3, and varied the noise variance at all receivers to 
simulate the presented SNR range. We observe that both teams 
switch to a constellation of a smaller size at SNR= 50 dB. 
This is due to both teams switching from pure communications 
to perform both communication and jamming. In order to do 
so, they both switch to a smaller constellation size which will 
guarantee robust communications for them as they will allocate 
some power to jam. 

VI. Conclusion and Future Work 

This paper has studied the power allocation problem for 
jamming teams. An underlying static game was used to obtain 
the optimal power allocation, where the power budget of each 
user is split between communication and jamming powers. 
A separate matrix game was utilized in order to arrive at the 
optimal modulation schemes for each team. This work focused 
on the analysis of teams consisting of two players only; a 
potential future direction is to generalize the results to teams 
consisting of multiple agents. Moreover, future work will con- 
sider scenarios of players possessing incomplete information 
and study the problem in the context of Bayesian games. 
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Fig. 2. Adaptive Modulation 
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